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Abstract — In this paper, we study the problem of reducing the 
energy consumption in a mobile communication network; we 
select the smallest set of active base stations that can preserve 
the quality of service (the minimum data rate) required by the 
users. In more detail, we start by posing this problem as an 
integer programming problem, the solution of which shows the 
optimal assignment (in the sense of minimizing the total energy 
consumption) between base stations and users. In particular, this 
solution shows which base stations can then be switched off or 
put in idle mode to save energy. However, solving this problem 
optimally is intractable in general, so in this study we develop 
a suboptimal approach that builds upon recent techniques that 
have been successfully applied to, among other problems, sparse 
signal reconstruction, portfolio optimization, statistical estima- 
tion, and error correction. More precisely, we relax the original 
integer programming problem as a minimization problem where 
the objective function is concave and the constraint set is convex. 
The resulting relaxed problem is still intractable in general, 
but we can apply the majorization-minimization algorithm to 
find good solutions (i.e., solutions attaining low objective value) 
with a low-complexity algorithm. In contrast to state-of-the-art 
approaches, the proposed algorithm can take into account inter- 
cell interference, is suitable for large-scale problems, and can be 
applied to heterogeneous networks (networks where base station 
consume different amounts of energy). 

I. Introduction 

The information technology sector contributes to an in- 
creasingly portion of the world's energy consumption, and 
thus there is an urge to improve the energy efficiency in 
communication networks. By improving the energy efficiency, 
network operators also reduce operational costs because en- 
ergy constitutes a significant part of their expenditures. Recent 
studies [ 1 1, [2 1 have shown that there are large load fluctuations 
in time and space in mobile networks; the traffic demand is low 
at night time and high during working hours. Therefore, there 
is a huge potential to save energy by adapting the network 
to the demanded traffic. Unfortunately, current networks are 
typically configured to provide the best possible quality of 
service (QoS) by assuming that the largest expected traffic 
is always demanded. This assumption often implies that all 
base stations should be powered at all times, thus wasting 
too much energy because base stations are one of the most 



energy expensive components of a mobile cellular network 
(they consume over 50 % of the total energy budget pj). 

Against this background, current work has been considering 
to minimize the number of base stations to provide a given 
quality of service to users in order to save energy Q, (5). 
In particular, the work in [4] has proposed a scheme to 
minimize the energy consumption by optimizing the number 
of base stations and their locations. The problem is posed as 
mixed integer programming problem, and the authors suggest 
to solve it with the simplex method and the branch and 
bound algorithm. Although this scheme has been originally 
proposed to find a fixed, non-adaptive network configuration, 
it can be easily extended to the case where the network 
configuration has to be modified according to changes in traffic 
demands during the day. However, this work focuses on the 
time division multiple access (TDMA) protocol, so it does 
not consider, for example, inter-cell interference, one of the 
major problems in modern systems [6 |. In addition, algorithms 
based on branch and bound methods are known to run for a 
very long time, even with problems of moderate sizes J7). 
In contrast, the work in (5j has proposed centralized and 
decentralized algorithms to address specifically the problem 
of base station selection in the present of traffic fluctuations 
in the network. These algorithms are fast, but they are based 
on heuristics and do not consider networks where base stations 
have different power consumption (which, in particular, is the 
case of modern networks consisting of hierarchical structures). 
Furthermore, no analytical justification is provided to support 
the good performance of the algorithms, and the dynamic 
power consumption of the base stations is also not considered. 

To address the limitations of the above techniques, we 
propose an algorithm that tries to select the smallest number 
of base stations needed to provide a required data rate to all 
users in the system. In more detail, we model the base station 
selection problem as an integer programming problem, which 
is known to be intractable for large systems (thus we cannot 
expect to solve this problem optimally). Therefore, to find a 
fast (but not necessarily optimal) solution, we use ideas similar 
to those successfully applied in sparse optimization based on 
convex programming (7}-||9). In more detail, building upon 



the results in ||8J, [10], we relax the integer programming 
problem by posing it as the minimization of a concave 
function constrained to a convex set, and we obtain a base 
station configuration attaining low objective value by using the 
majorization-minimization (MM) algorithm JTTJ. In doing so, 
we are able to devise an algorithm that is fast, has an analytical 
justification for its good performance, can easily consider 
heterogeneous networks, and can take into account inter-cell 
interference and the transmitted power of base stations. 

The remainder of this paper is organized as follows. In 
Section|II]we present the system model. In Section III we show 
the optimization problem and derive the proposed algorithm, 



which is evaluated empirically in Section IV 



II. Scenario and System Model 

In this study, we consider a representative urban cellular 
network with a dense base station deployment. In more detail, 
we denote the set of all base stations as Ai — {1,2, ...,M} 
and the set of all users in a cellular radio network as 
Af = {1, 2, N}. As in J5J, the channel state information 
(CSI) of the channels, and hence their spectral efficiency, is 
available at the base stations. We also assume that the spectral 
efficiency is known for any link at a central unit, and there 
is no intra-cell interference (the latter assumption is common 
in network planning of modern systems (6)). To account for 
inter-cell interference, we assume the worst case interference 
when computing the spectral efficiency. More precisely, we 
approximate the spectral efficiency of the link from base 
station i to user j by 
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where P 

suitable scaling factors (known as the bandwidth efficiency and 
signal-to-interference-plus-noise ratio efficiency, respectively) 
|6); and Pi t j is the received signal power from base station i to 



user j, which is determined by the ITU log-distance path loss 
model with shadow fading for urban macro cell environments 

All base stations report their CSI and the QoS require- 
ments of the users to a central unit, which also knows the 
spatial traffic load distribution. All users have fixed QoS 
requirements represented by a minimum required data rate 
r = [ri, ra, . . . , r^v]. To support this data rate, base station 
i, to which user j is connected, has to allocate bandwidth 
bi j — -^7, where lu, - j is the ith row and jth column entry of 
the spectral efficiency matrix u> e jj MxW holding the spectral 
efficiency for all links. In addition to the QoS constraints for 
the users, each base station i has only a limited amount of 
bandwidth to allocate to its users. 

The problem we study in this paper is to find the set of 
base stations consuming the smallest amount of energy while 
providing the desired QoS level to each user. In the next 
section we formalize this problem and provide an efficient 
solution. 



III. Sparse Optimization for Increased Energy 
Efficiency 

Let X € {0, 1} x be a 0-1 matrix where Xij denotes its 
ith row and jth column. The parameter Xij is 1 if user j is 
connected to base station i and otherwise. To save energy 
by finding the minimum number of base stations necessary to 
provide the minimum QoS (data rate) to every user, we have 
to solve the following optimization problem: 
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where || • ||q denotes the ^o- norm (the number of non-zero 
elements) and 1 £ M. N denotes the vector of ones. (We 
assume that there is at least one feasible solution to problem 
(|2]i-(|5]l.) The parameter c is the static energy consumption 
per base statiorQ and the function /^^(X) is a suitable 
concave or convex function accounting for the dynamic energy 
consumption depending on the load at the base station. In 
the following, for the sake of simplicity, we neglect the 
dynamic part because, with current technology, the dynamic 
energy consumption is marginal compared to the static energy 
consumption for a typical base station pi. (This assumption 
also enables us to compare the proposed scheme with the cell 
zooming approach in (5}.) The constraints in ([3]) guarantee 
that users do not exceed the bandwidth available to the base 
stations, whereas the constraints in Q force every user to 
be connected to exactly one base station. Each row of X 
corresponds to a base station, so we seek solutions X* with 
as many rows having all-zero entries as possible. These rows 
correspond to the base stations that will be deactivated. Note 
that problem ([2]>-(j5]> is a combinatorial problem, and thus 
intractable if MN is large. Building upon the results in [8], 
we devise a low-complexity algorithm that tries to solve a 
problem strongly related to (|2j-([5j and that is able to provide 
matrices X with good row sparsity patterns. To do so, we first 
rewrite Q in a more convenient form for our purposes. 

For mathematical convenience, define w := vcc(X) =: 
[wi . . . wmn] T € M. MN where vec(-) is the operator 
vectorizing a matrix by stacking its columns. For any given 
h = [hi . . . /ia/] T G M. M , we can verify the following simple 
relation (8), (TO): 
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1 Here we assume that all base stations consume the same amount of power 
when active. However, in strong contrast to [5], our scheme can easily be 
adapted to the case where base stations consume different amounts of power. 



Therefore, problem |2]l-(|5]l (ignoring / WiI .(X) for the rea- 
sons described above) is equivalent to 
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is a matrix 



u> 4 e {0,1} 

where s { = vec(S 2 ) G M Mjv and S, 
of zeros, except for its ith row, which is a row of ones. For 
mathematical tractability, we relax the above problem by fixing 
e > and by using a convex relaxation of the last constraintjj 
In doing so, we obtain the following problem: 
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u>< e [0,1] i€{l,...,MJV}. (9) 

Alternatively, problem (|6]l-(|9]l (ignoring unnecessary con- 
stants) can be expressed more compactly as: 
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where A" C M MAr is the closed and convex set consisting 
of points satisfying the constraints in (|7j, ([8]), and ([91. The 
problem in (lOi (which is a relaxation of problem (|2])-([5]l) 
is the optimization problem we try to solve in this study. 
Unfortunately, it is still difficult to solve because we are 
minimizing a concave function. However, note that the concave 
objective function 
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is differentiable on X with gradient given by 
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V/(w) = J2 



As a result, we can use the majorization-minimization (MM) 
algorithm (shown in appendix [A} to find a sequence of vectors 

2 Here we say that two optimization problems are equivalent if the set of 
solutions is the same. 

3 The relaxation also gives rise to adaptions to coordinated multi-point 
transmission/reception (CoMP) scenarios, where users are allowed to connect 
to multiple base stations simultaneously. 



w( n ) with non-increasing objective value; i.e., / (w(™ +1 )) < 
/ (wW). For sufficiently large n we can expect to obtain 
solutions with good row sparsity patterns. More precisely, let 

.g(x,y) = /(y)+V/(y) T (x-y), 

be the majorizing function of / used by the MM algorithm. 
By doing so, the main iteration of the MM algorithm (which 



is obtained by substituting g above into ( 16 1 shown in the 
appendix) reduces to 
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where w' ' is an arbitrary, feasible vector. The computation 



of the sequence from (12i is the core task we perform in 
our algorithm (summarized in Alg. [TJ. Note that ( fl2] > is a 
linear programming (LP) problem and hence it can be solved 
efficiently. 



For the iteration in ( 12 1, given a small arbitrary value e* > 
0, we terminate the algorithm when either 



/ (w< n >) - / (w(" +1 >) 



< e* 



(13) 



is valid or when the maximum number of iterations n is 
reached. Note that, even if we do not set the maximum number 
of iterations, the algorithm eventually terminates because 
f (y/( n )\ converges (see the appendix). Upon termination, 
because of constraint (|9j, we may obtain solutions X* where 
some entries are in the interval (0, 1) (though most entries of 
the results shown in section [iV] are in {0, 1}). Unfortunately, 
simply rounding those entries to the set {0, 1} may lead to 
violations of the constraint in dTll. Hence, here we use the 
heuristic outlined in Alg. [2] The main idea is to connect the 
users with entries from {0, 1} first. Then we try to connect 
the remaining users, starting from those corresponding to large 
entries Wjj. New base stations are activated if the preceding 
operation fails. 

Algorithm 1 Network reconfiguration for improved energy 
efficiency 

Input: set of all users, set of all base stations, constraints 
Output: optimized network configuration 
1: initialize w' ^ with a feasible point 
repeat 



compute w by solving the LP in ( 12 1 
increment n 
update w 1 ™) := w 
until ( fjjf l is valid or n = n 

use a heuristic to map w w from [0,1] MJV into {0,1} 



MN 



8: connect the users to base stations according to w( n ). 
9: deactivate all base stations no user is connected to. 

Remark: The initialization of the MM algorithm is crucial 
to the overall performance. A bad initialization can lead to 
bad performance. We observed that a good starting point is to 



Algorithm 2 User assignment to base stations 

Input: solution X*, set of all users, set of all base stations, 

constraints 
Output: user assignment table 
1: assign users with entries Xij £ {0, 1} to the correspond- 
ing base stations. 
2: sort Xij with values in (0, 1) in descending order 
3: for all users j in the sorted set do 

4: assign user j to base station i corresponding to the 
largest entry Xi.j that does not lead to a bandwidth 
violation at base station i. 

5: end for 

6: for all users j not assigned to any active base station do 
7: activate the closest non-active base station and assign 

user j to it. 
8: end for 
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Fig. 1. Objective value and number of active base stations as a function of 
the number of iterations (for one realization). The dashed curve shows the 
objective in 111) . The solid line shows the corresponding number of active 
base stations for the result of each iteration. 



initialize the weights with a feasible solution w^ ) obtained 
by connecting each user to its closest base station. 

IV. Empirical evaluation 

The simulation environment is similar to that in (5), and 
it consists of 100 cells in a 10 by 10 hexagonal cell layout. 
To avoid boundary effects, we use a wrap around model. The 
inter cell distance is 500m, and all base stations have the same 
bandwidth limit B = 5MHz. Furthermore, we use c = 40014 7 
as the typical power consumption for a running base station. 
For simplicity all users have the same rate requirement r = 
122kb/s. New users are arriving according to a Poisson process 
with arrival rate A. To get a spatially fluctuating user pattern, 
we define three hotspots with high user density. Those hotspots 
are normally distributed in the area, and they have a radius of 
500m. New arriving users are dropped in a hotspot with a 
probability of 5% each. Their location within the hotspot is 
normally distributed around the center. The remaining area 
has a uniform user distribution (including the hotspot areas). 
The spectral efficiency of each link from all base stations to all 
users is calculated by ([TJ, and the signal powers follow the ITU 
propagation model for urban macro cell environments fl2l. 
Unless otherwise stated, we use for the LP in 
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and for the stopping criteria of the proposed algorithm n = 20 
and e* = 10~ 3 . 

In Fig. [T] we show that the proposed algorithm typically 
satisfies the stopping criterion ( p"3j ) before reaching the max- 
imum number of iterations. Note that, for visual clarity, we 
let the algorithm run for at least n iterations in Fig. [T] even if 
the alternative stopping criterion (13 1 is satisfied. For a fixed 



user arrival rate leading to a mean number of 400 users in 
the network, we have observed that, after six iterations, the 
objective value 



11 



improves only marginally. In addition to 
the objective value, we also plot in Fig. [T] the cardinality of 
the set of active base stations obtained at each iteration. It can 
be seen that it follows the same trend. 

Curves for other user arrival rates have shown a similar 
pattern. We have always observed a high decreasing rate of 



the objective value within approximately the first ten iterations 
and only a marginal decrease afterwards. More iterations have 
not improved the solution significantly in our scenario. 

In Fig. [2] we compare the proposed algorithm with the 
centralized cell zooming approach in [5|. The results are av- 
eraged over ten realizations for the same user arrival rate. The 
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Fig. 2. Cardinality of active base station set for different user arrival rates 
A needed to provide QoS to all users. The results are averaged over ten 
realizations for each user arrival rate and the height of the error bars shows 
the estimated error of the mean. The solid line shows our algorithm and the 
dashed line the cell zooming algorithm from j5j 

simulations show that the number of base stations increases 
as the user arrival rate increases. It can be observed that, 
compared with the cell zooming algorithm, our algorithm gives 
solutions that satisfy the user constraints and use fewer base 
stations. In addition, the proposed method has other major 
advantages over the cell zooming approach: 

• It can be extended to heterogeneous networks, where we 
have different power consumption for different network 
elements. 

• It can model the power consumption of the dynamic load, 
provided that it can be modeled as a concave or convex 
function. 



• It has a strong analytical justification for its good perfor- 
mance. 

V. Conclusions 

In this paper we have developed a novel approach to save 
energy in wireless networks by selecting a small number of 
active base stations that guarantees that the QoS (i.e., the 
minimum data rate) of all users is satisfied. We have shown 
that recent techniques that have been applied in, for example, 
compressed sensing can also be successfully applied in this ap- 
plication domain. In particular, the proposed algorithm needs 
to solve a simple LP at each iteration, so it can easily handle 
large-scale problems. Simulations show that good sparse solu- 
tions are obtained with few iterations. We have also shown that 
our technique can outperform recent methods such as the cell- 
zooming approach j5) in practical scenarios. Finally, in stark 
contrast with the work in [5 1, the proposed algorithm has other 
additional advantages such as a good analytical justification 
and the ability to consider heterogeneous networks and to 
consider the dynamic transmission power of base stations. 
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Appendix 

A. The majorization-minimization algorithm 

For convenience, in this section we review the MM algo- 
rithm. The presentation here is based on flO) , pT[ . 

Suppose that we want to minimize a function / : X —¥ M, 
where X C M. N . In particular, in this study we assume 
that all optimization problems have a solution; i.e., there 
exists x* e X satisfying x* e argmin xS ^ /(x). Unless 
the optimization problem has a very special structure that 
can be exploited (e.g., convexity), finding such a point x* is 
computationally intractable in general, so we have to content 
ourselves with generating a sequence of vectors with non- 
increasing objective value, as explained below. 

A standard means of attaining small objective values 
is to apply the majorization-minimization (MM) technique 
[ fTT) , which is a generalization of the celebrated expectation- 
maximization (EM) algorithm. In more detail, the MM algo- 
rithm is an iterative approach that tries to find a minimum of 
/ by minimizing at each iteration a surrogate function that i) 
majorizes / at every point in X and that ii) is tangent to / at 
the current estimate of a minimizer. More precisely, to apply 
the MM algorithm, we first need a function g : X x X — >• R 



satisfying the following (see also |10|): 
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Then, starting from x( ) € X, the MM algorithm produces a 
sequence G X (n G N) by 

,(«+!) 
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From the above, we see that the function g should be suffi- 
ciently structured in order to make the optimization problem 
in ( 16 1 easy to solve with efficient numerical approaches. In 



particular, if / is concave and differentiable, a natural choice 
for g is 



<?(x,y) = /(y)+V/(y) T (x-y), 



(17) 



in which case the optimization problem in ( [To} becomes a 
convex optimization problem provided that A" is a convex set. 
This particular choice is common in, for example, sparse signal 
recovery (8j. 

More generally, irrespective of the choice of / satisfying 
properties (14i and (15 1, we can easily verify that f{*S n >) is 



a monotone decreasing sequence: 
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(n+l) „(n+l)> 



< 3 (x( n+1 \x(")) < 9(xW,x("») = /(x< n >), 

where the equalities follow from ( [15) , and the two inequalities 
follow from ( 14 1 and ( 16 1, respectively. As a result, f(x^ n ' ) — > 



ceM for some c > /(x*) as n — > oo (which in general does 
not imply the convergence of the sequence x' n )). 
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